Novel fibronectin-binding protein

ABSTRACT

The present invention is concerned with a novel fibronectin-binding protein of  Streptococcus equi,  to a DNA fragment encoding this protein, to host cells and vectors containing said DNA fragment and to methods to produce said protein based on recombinant DNA technology.  
     The invention is also related to use of said protein in the preparation of a vaccine, to a vaccine containing said protein, to antibodies specific for said protein and to polyvalent antisera containing such antibodies.

[0001] This application is the national phase under 35 U.S.C. §371 of PCT International Application No. PCT/SE99/02448 which has an International filing date of Dec. 21, 1999, which designated the United States of America.

[0002] The present invention is generally related to a novel protein, methods to produce said protein and use thereof, e.g. for immunization purposes.

BACKGROUND OF THE INVENTION

[0003] 1. Field of the Invention

[0004] More specifically, the present invention is related to a novel fibronectin-binding protein derived from a bacterium belonging to the genus Streptococcus, to a DNA sequence encoding said protein, recombinant DNA methods for the production of said protein, and use of said protein per se or a fragment thereof as an immunogenic protein or antigenic polypeptide or peptide, e.g. for use as an active component in a vaccine, or to produce antisera.

[0005] 2. Description of the Related Art

[0006] Streptococcal infections in horses are mainly caused by the species Streptococcus equi, which is classified as a Lancefield Group C Streptococcus and comprises two subspecies designated equi and zooepidemicus, respectively. Streptococcus equi subsp. equi which is virtually confined to horses is the causative agent of strangles, a world-wide distributed and serious disease of the equine upper respiratory tract. Since strangles is a highly contagious disease, not only infected animals but also all other members of an afflicted stud must be isolated for as long as up to three months.

[0007]S. equi subsp. zooepidemicus, is considered as an opportunistic commensal often occurring in the upper respiratory tract of healthy horses. However, after stress or virus infection, it can cause a secondary infection, which results in strangles-like symptoms. Moreover, subsp. zooepidemicus infects not only horses but also a wide range of other animals, like pigs, dogs, cats, and cows. Even human cases with infection due to subsp. zooepidemicus have been reported. This subspecies has been implicated as the primary pathogen in conditions such as endometritis, cervicitis, abortion, mastitis, pneumonia, abscesses and joint infections.

[0008] Although it is possible to treat and cure these streptococcal infections with antibiotics, such as penicillin, tetracycline or gentamicin, an effective prophylactic agent, that could prevent outbursts of such infections and obviate or reduce the risk for development of resistant strains associated with antibiotic treatment, would be appreciated.

[0009] However, although many attempts have been made to develop prophylactic agents such as vaccines against S. equi, at the present time no efficient vaccines or immunizing preparations are available, neither for the subspecies equi nor for the subspecies zooepidemicus.

[0010] Existing vaccines against strangles are based on inactivated, e.g. heat-killed, or attenuated strains of S. equi subsp. equi or acid extracts/mutanolysin enriched in M-protein(s), i.e. immunogenic protein(s) produced by S. equi. A vaccine against S. equi subsp. zooepidemicus based on an M-like protein is disclosed in U.S. Pat. No. 5,583,014.

[0011] Since the previously developed vaccines or immunizing preparations are hampered by side-effects and, moreover, provide insufficient protection, there is a need for efficient prophylactic agents, such as vaccines, that protect against S. equi infections and/or prevent spread thereof.

[0012] It is well known that attachment to eukaryotic cell surfaces is an essential step in the establishment of infection and colonization by bacterial pathogens. Accordingly, streptococcal surface proteins, that interact with and/or bind to different components of the Extracellular

[0013] Matrix (ECM) or plasma proteins of the host cell, are potential candidates for use as active component(s) for immunizing purposes.

[0014] This is illustrated by the vaccines based on M-like proteins mentioned above or disclosed in the literature, i. a. in WO 98/0561. The binding of fibrinogen and complement factor H to M-proteins is assumed to be important for the ability of streptococci to resist phagocytosis by polymorphonuclear leucocytes.

[0015] Another mechanism used by streptococci for attachment to host cells involves binding to the ECM component fibronectin (Fn) (3, 4). Binding between Fn-binding bacterial cell-surface proteins and immobilized Fn promotes internalization of streptococci by epithelial cells (1, 7, 11). Fibronectin is a dimeric glycoprotein found both in plasma and in a fibrillar form in the extracellular matrix. The main function of Fn is to mediate substrate adhesion of eukaryotic cells, which involves the binding of specific cell-surface receptors to certain domains of the Fn molecule (5). Furthermore, it also interacts with several other macromolecules, such as DNA, heparin, fibrin, and collagen (5).

[0016] Accordingly, several Fn-binding proteins from different streptococcal species have been cloned and sequenced previously. From S. equi, one Fn-binding protein has been cloned and characterized earlier, which is a Fn-binding cell-surface protein of subsp. zooepidemicus, that has been designated FNZ (9).

[0017] Recently, a novel gene encoding a Fn-binding protein has been cloned from S. equi subsp. equi. The encoded protein, is clearly distinguishable from the previously isolated streptococcal Fn-binding proteins inclusive of the FNZ protein.

[0018] The present invention is based on this novel protein originally derived from S. equi subsp. equi and its potential use for immunization purposes.

BRIEF SUMMARY OF THE INVENTION

[0019] Generally, the present invention is directed to a protein having an amino acid sequence encoded by a nucleic acid sequence or gene, that forms a portion of the genome of S. equi subsp. equi, and which protein binds specifically to mammalian fibronectin.

[0020] The present invention is also directed to an isolated protein, specifically binding to fibronectin, such as mammalian, and specifically equine, fibronectin, and having an amino acid sequence as shown in SEQ. ID. NO. 1.

[0021] Moreover, the present invention is generally concerned with analogs or fragments of the present protein having fibronectin-binding-properties. For instance, a suitable fragment is comprised of the sequence of the present protein, that lacks the N-terminal signal sequence of the preprotein. A further suitable fragment of the present protein lacks a portion of said amino acid sequence, said portion comprising an amino acid sequence binding to a collagen-binding domain of fibronectin.

[0022] The present invention is also concerned with methods to produce the present protein, analogs or fragments thereof, which methods are based on DNA technology; and with nucleic acid sequences, and more specifically DNA sequences or fragments, intended for use in such methods, as well as use of said protein, analogs or fragments thereof for therapeutic purposes, such as immunizing purposes.

[0023] The novel protein has been termed SFS, and, accordingly, the corresponding gene is designated sfs. For the purpose of convenience, these terms are frequently used in the description.

BRIEF DESCRIPTION OF THE DRAWINGS

[0024] In the following, the present invention is disclosed more in detail with reference to the drawings, where

[0025]FIG. 1 (A) shows a map of a clone designated pSFS62 with the gene sfs indicated.

[0026]FIG. 1 (B) shows a schematic presentation of protein SFS with the functional domains indicated. The bars correspond to the amino acid sequences of phagemid clones isolated by panning against Fn (S1-S4). Figures refer to the amino acid positions in protein SFS as shown in SEQ. ID. NO. 1 and the figures within brackets indicate the number of identical clones, that were isolated and sequenced.

[0027]FIG. 2 shows the results of Southern blot analysis of chromosomal DNA from ten streptococcal isolates. The DNA was digested by ApaI and separated by pulsed-field gel electrophoresis in duplicate. The radioactively labeled probe used corresponds to the gene sfs. Lanes: 1, subsp. zooepidemicus ZV; 2, S. dysgalactiae S2; 3, S. equisimilis 172; 4, subsp. equi Bd 3221; 5, subsp. equi Bd 995; 6, subsp. zooepidemicus DSM 20727^(T); 7, subsp. zooepidemicus ATCC 53698; 8, subsp. equi CCUG 11664; 9, subsp. equi NCTC 9682^(T); 10, S. pyogenes AW43. Molecular weight marker (concatamers of lamda) is indicated to the left.

[0028]FIG. 3 shows the results from inhibition assays related to Fn-binding. Cells of subsp. zooepidemicus ZV, subsp. zooepidemicus DSM 20727, subsp. equi Bd3221, and subsp. equi 640 were incubated with iodine-labeled Fn (hatched bars) and with a mixture of iodine-labeled Fn and protein SFS-E (striped bars). The bars represent means of duplicates and the standard deviation is indicated.

[0029]FIG. 4 shows the results from inhibition assays related to inhibition of binding between collagen and Fn with protein SFS. Collagen type I coated microtiter wells were incubated with Fn and a two-fold serial dilution of SFS. Bound Fn was detected by antibodies as described in Example 3. Points represent means of duplicates and the standard deviation is indicated.

DETAILED DESCRIPTION OF THE INVENTION

[0030] More specifically, the present invention is directed to a fibronectin-binding (hereinafter abbreviated Fn-binding) protein, which has an amino acid sequence that can be expressed from a nucleic acid coding sequence, that can be isolated from and forms a portion of the genomes of S. equi, for instance subsp. equi.

[0031] According to a suitable embodiment, the present invention is directed to an isolated protein, specifically binding to fibronectin and having an amino acid sequence as shown in SEQ. ID. NO. 1 below, or a fragment or analog thereof. Met Arg Lys Thr Glu Gly Arg Phe Arg Thr Trp Lys Ser Lys Lys Gln: SEQ. ID. NO. 1 1               5                  10                  15 Trp Leu Phe Ala Gly Ala Val Val Thr Ser Leu Leu Leu Gly Ala Ala            20                  25                  30 Leu Val Phe Gly Gly Leu Leu Gly Ser Leu Gly Gly Ser Ser His Gln        35                 40                 45 Ala Arg Pro Lys Glu Gln Pro Val Ser Ser Ile Gly Asp Asp Asp Lys    50                 55                 60 Ser His Lys Ser Ser Ser Asp Gln Pro Thr Asn His Gln His Gln Ala 65                 70                 75                 80 Thr Ser Pro Ser Gln Pro Thr Ala Lys Ser Ser Gly His His Gly Asn                85                 90                 95 Gln Pro Gln Ser Leu Ser Val Asn Ser Gln Gly Asn Ser Ser Gly Gln            100                105                 110 Ala Ser Glu Pro Gln Ala Ile Pro Asn Gln His His Gln Pro Gln Gly        115                120                125 Lys Pro Gln His Leu Asp Leu Gly Lys Asp Asn Ser Ser Pro Gln Pro    130                135                140 Gln Pro Lys Pro Gln Gly Asn Ser Pro Lys Leu Pro Glu Lys Gly Leu 145                150                155                160 Asn Gly Glu Asn Gln Lys Glu Pro Glu Gln Gly Glu Arg Gly Leu Pro                165                170                175 Gly Leu Asn Gly Glu Asn Gln Lys Glu Pro Glu Gln Gly Glu Arg Gly            180                185                190 Glu Ala Gly Pro Pro Ser Thr Pro Asn Leu Glu Gly Asn Asn Arg Lys        195                200                205 Asn Pro Leu Lys Gly Leu Asp Gly Glu Asn Lys Pro Lys Glu Asp Leu    210                215                220 Asp Gly Tyr Asn His Gly Arg Arg Asp Gly Tyr Arg Val Gly Tyr Glu 225                230                235                240 Asp Gly Tyr Gly Gly Lys Lys His Lys Gly Asp Tyr Pro Lys Arg Phe                245                250                255 Asp GLu Ser Ser Pro Lys GLu Tyr Asn Asp Tyr Ser Gln Gly Tyr Asn            260                265                270 Asp Asn Tyr Gly Asn Gly Asn Pro Asp        275                280

[0032] The present invention is also related to proteins or polypeptides having an amino acid sequence as shown in SEQ. ID. NO. 1 containing deletions or substitutions of amino acids, such as fragments and analogs of the present protein having fibronectin-binding properties, suitably conserved, or specifically designed, Fn-binding properties. One such fragment or analog is comprised of the mature protein lacking the N-terminal amino acids no. 1 to 29 inclusive. Other fragments have en amino acid sequence corresponding to a portion of the amino acid sequence as shown in SEQ. ID. NO. 1 comprising a fibronectin-binding domain or an antigenic determinant or epitope. Still other fragments have an amino acid sequence corresponding to a portion of the sequence as shown in SEQ. ID. NO. 1, wherein an amino acid sequence binding to a collagen-binding domain of fibronectin (Fn) and comprising the amino acids (SEQ. ID. NO. 3) QGERGEAGPP, is deleted.

[0033] A further embodiment is concerned with a protein of the present invention having an amino acid composition of approximately 53 glycine residues, 39 serine residues and 38 proline residues evenly distributed in the protein and optionally 13 tyrosine residues in the C-terminal part of the protein.

[0034] Obviously, the present invention is concerned with a wild-type ptotein encoded by S. equi, that can be isolated and purified when recovered from said organism, as well as with a recombinant SFS protein as discussed above, said proteins having Fn-binding properties.

[0035] The present invention is also concerned with a nucleic acid sequence encoding the SFS protein or fragments or analogs thereof. Suitably, this sequence is a DNA sequence and contains an SFS coding sequence, such as the entire sfs gene, or a portion thereof encoding the SFS protein or a fragment or analog therof.

[0036] Accordingly, one embodiment of the present invention is related to a DNA sequence having a nucleotide sequence as shown in SEQ, ID. NO. 2 or to an equivalent thereof. SEQ. ID. NO. 2 +TL, 44 ATGAGAAAAA CAGAAGGACG TTTTCGCACA TGGAAGTCCA AAAAACAATG GCTATTTGCC: 60 GGTGCAGTGG GAGCTGCACT TGTCTTTGGA GGTTTATTAG GAAGTCTTGG TGGCTCATCC 120 CAGCAGCCAG TCAGCTCGAT TGGAGATGAC GATAAGTCGC ACAAGAGCTC ATCACCACCG 180 AAAAAGGATA ACTTGCAGCC TAAGCCTTCA GATCAGCCTA CTAATCGCCC GTCCCAGCCG 240 ACAGCAAAGA GCTCAGGTCA TCATGGGAAT CAACCTCACC AAGGAAATAG TAGTGGACAG 300 GCCTCAGAGC CTCAGGCTAT TCCTAATCAA GGGCTCAGAG GAGGTAACAG CTCTGGTTCA 360 GGTCATCACC ATCAGCCACA AGATCTAGGT AAGGATAATT CTAGCCCGCA GCCTCAACCA 420 AAGCCTCAGG GCAAAAAAGG CTTGAATGGT GAAAATCAGA AGGAACCGGA GCAAGGTGAA 480 CGAGGTTCAG GGTTGAGTGG TAATAATCAA GGCCGTCCTT CGCTTCCAGG CTTGAATGCA 540 GAGCAAGGTG AACGAGGTGA AGCCGGTCCC CCATCAACTC CGAATTTAGA TCCTTTAAAA 600 GGATTAGATG GAGAGAATAA GCCAAAGGAA GATTTAGACG GTAATGATGA ATCACCAAAA 660 CTTAAAGACG AACACCCCTA CAATCATGGT CGTCGCTATG AGGATGGATA TGGTGGCAAA 720 AAGCACAAAG GAGATTATCC TAAGCGAAAG GAATATAATG ACTATAGTCA AGGGTATAAT 780 GATAATTATG GAAATGGCGA TAGAGGTGGT AAGAGAGGAT ACGGCTATTC TTACAATCCC 840 GACTAA 846

[0037] The present invention is also related DNA sequences having nucleotide substitutions, that do not change the encoded product or interfere with its expression. This is due to the well-known redundancy of the genetic code, i.e. more than one coding nucleotide triplet (codon) can code for or define a particular amino acid residue or a function, such as a stop codon function, etc. Thus, such functionally equivalent sequences are also encompassed by the present invention. Such equivalents may also arise due to spontaneous mutations.

[0038] Accordingly, when used to produce the present protein, such as a protein having the amino acid sequence shown in SEQ. ID. NO. 1, with methods based on recombinant DNA technology, the above DNA sequence may be modified to adapt to the codon frequency of the host organism used to produce the present protein, or fragments or analogs thereof.

[0039] The present invention is also concerned with a host cell comprising a DNA fragment or sequence of the present invention, e.g. a eukaryotic or, suitably, a prokaryotic host cell. A suitable prokaryotic host cell is derived from different strains of E. coli.

[0040] Furthermore, the present invention is concerned with a method to produce the present protein, fragments or analogs thereof comprising culturing a host cell containing the DNA sequence of the present invention, and isolating the expressed protein from the culture. Suitably this method also comprises purification of the expression product, such as affinity chromatography purification, conveniently based on use of “affinity tails”.

[0041] Thus, a suitable method comprises

[0042] (a) introducing a DNA fragment of the present invention into an expression vector;

[0043] (b) introducing the said vector, which contains the said DNA fragment, into a compatible host cell;

[0044] (c) culturing the host cell provided in step (b) under conditions required for expression of the product encoded by said DNA fragment; and

[0045] (d) isolating the expressed product from the cultured host cell.

[0046] Preferably, said method further comprises a step (e), wherein the isolated product from step (d) is purified, e.g. by affinity chromatography, such as Fn-affinity chromatography, or with the use of “affinity tails” as is well-known in this field of art.

[0047] The present invention is further concerned with a vaccine comprising the present Fn-binding protein or a fragment or an analog thereof as an antigenic or immunogenic component. This vaccine is intended for use as a vaccine protecting against infection with any one of the two subspecies equi and zooepidemicus of S. equi.

[0048] A further embodiment of the present invention is concerned with a vaccine as defined above, that protects horses against strangles caused by S. equi subsp. equi infection. Suitably, such a vaccine could protect also against infection with subsp. zooepidemicus.

[0049] The vaccine of the present invention is suitably a subunit vaccine. Moreover, the vaccine may be comprised of a cocktail of antigenic and/and or immunogenic components comprising the present protein or an analog or a fragment thereof as one such component.

[0050] The present invention is also concerned with use of the present Fn-binding protein, analogs or fragments thereof in the preparation of a vaccine protecting against S. equi, for instance a vaccine as disclosed above.

[0051] Furthermore, the present invention is also concerned with the production of antibodies raised against the present protein, analogs or fragments thereof, with fragments of such antibodies and with the production of antisera. Antibodies and/or antisera could be produced by in vivo administration, e.g. injection, of an antigen comprising the said protein, an analog or a fragment thereof, to a host to elicit an immune response in said host, and recovering antiserum thereby produced in said host and, optionally, recovering or isolating antibodies contained in said antiserum or in other body fluids from said host. Not only polyclonal but also monoclonal antibodies could be produced in accordance with well-known methods.

Experimental Part

[0052] Bacterial strains, plasmids, and growth conditions. Ninety-eight clinical isolates of S. equi (50 isolates of subsp. equi strains and 48 isolates of subsp. zooepidemicus strains) collected from different parts of Sweden between 1982-1996 were together with the streptococcal control strains S. dysgalactiae S2 and S. equisimilis 172 obtained from the National Veterinary Institute, Uppsala. The S. pyogenes strain AW-43 was a kind gift from Dr G. Lindahl, Lund University. The plasmid pUC19 was used for cloning purposes and pGEX-5X-2 (Pharmacia Biotech, Uppsala, Sweden) for facilitating purification of proteins. Phagemid pG8SAET (19) was used for purification of protein SFS and for construction of the phage display library. Streptococcal strains were grown on horse blood agar plates or in Todd-Hewitt broth (Oxoid, Basingstoke, UK) supplemented with 0.5% yeast extract (THY). E. coli strains were cultured in Luria-Bertani (LB) medium supplemented in appropriate cases with 50 μg of ampicillin per ml.

[0053] As regards the isolates of S. equi, it is known that isolates of subsp. equi are serologically and genetically very homogeneous whereas isolates of subsp. zooepidemicus display a high degree of heterogeneity (2, 8, 10, 13).

EXAMPLE 1 Cloning and Isolation of a Gene sfs Encoding the Fn-binding Protein SFS.

[0054] A. Construction of a phagemid library. A shotgun phage display library was constructed from subsp. equi Bd 3221 essentially as described by Jacobsson and Frykberg (6). Briefly, chromosomal DNA of this strain was isolated as described earlier (9) and subsequently purified and fragmented by sonication.

[0055] The obtained fragments were treated with T4 DNA polymerase to generate blunt ends and subsequently ligated into SnaBI-digested and dephosphorylated pG8SAET vector. Approximately 4.5×10⁶ ampicillin-resistant transformants were obtained after electrotransformation of the ligated material into E. coli, TG1 cells.

[0056] Twenty randomly picked transformants were all shown to contain inserts. Cells from an overnight culture of the transformants were infected with helper phage R408 and poured together with soft agar onto LA+ampicillin plates (LB medium supplemented with 1.5% agar and 50 μg ampicillin/ml) and incubated overnight. Phage particles were eluted from the soft agar by addition of LB and vigorous shaking. The suspension was centrifugated and the supernatant sterile filtrated. The titer of the library was determined to 7×10¹⁰ CFU/ml.

[0057] This library was used in the following Section B, wherein phage particles containing inserts related to Fn-binding properties are identified.

[0058] B. Panning of the phagemid library. Microtiter wells (Maxisorp, Nunc, Copenhagen, Denmark) were coated with human Fn (Sigma, St. Louis, Mo.) at a concentration of 100 μg/ml in 50 mM sodium carbonate, pH 9.7. The wells were blocked with PBS-0.05% Tween 20 (PBS-T) containing casein (0.1 mg/ml). After washing the wells with PBS-T, the library from Section A above was added to the wells. Thereafter, the wells were extensively washed with PBS-T and then eluted with 140 mM NaCl, 50 mM Na-citrate (pH 2.0).

[0059] To obtain transformed cells containing the Fn-binding insert, eluate was collected, neutralized, infected with E. coli TGI cells and spread on LA plates containing ampicillin. Next day, approximately 1,500 colonies were pooled and after infection with helper phage R408 the sample was mixed with soft agar and poured out on LA plates. After incubation overnight, the phagemid particles were extracted and subjected to another round of panning. After the first panning, 41% of the colonies were found to bind Fn and after the second panning all 180 colonies were positive for Fn-binding.

[0060] C. Screening for Fn-binding clones containing an insert encoding SFS. In this section, the gene encoding the novel Fn-binding protein SFS was screened for based on the knowledge that the previously disclosed fnz gene isolated from subspecies zooepidemicus and encoding the Fn-binding protein FNZ is present also in the genome of subspecies equi. Thus, a negative screening test was used, wherein cells containing inserts related to Fn-binding, i.e. cells positive when screening with a rabbit anti-Fn antibody, but not containing inserts related to Fn-binding of FNZ, i.e. those of the positive cells that were negative when the fnz gene is used as a screening probe, were selected and presumed to contain SFS-related inserts.

[0061] From each panning performed in section B, 180 colonies were transferred in triplicate to LA plates and incubated overnight. The following day, one plate was stored (masterplate) and the colonies from the two remaining plates were transferred to nitrocellulosa filters and incubated for two hours.

[0062] Cells from one filter were lysed using chloroform vapor and after blocking the filter with PBS-T supplemented with casein (0.1 mg/ml), it was incubated with human Fn (1 μg/ml; Sigma) for 2 h, and after washing, a rabbit anti-Fn antibody (diluted 1/1000; Sigma) was added. After 1 h of incubation and washing, the filter was incubated for additional 1 h with a HRP-labeled secondary antibody (diluted 1/1000; Bio-Rad, Richmond, Calif.). Reactive bands were visualized by using 4-chloro-1-naphthol (Serva, Heidelberg, Germany).

[0063] The second filter was subjected to colony hybridization essentially as described in Sambrook et al. (22) with use of a radioactively labeled probe that covered the entire fnz gene and was generated by PCR amplification of chromosomal DNA from subsp. zooepidemicus strain ZV using the primers: 5-fnz, 5′-CGGGATCCCTATTACACATTCTCATCTCATAT (positions 19-42) and (SEQ. ID. NO. 4) 3-fnz, 5′-GGAATTCCAGAAAGCCCGCCTGTAAAC (positions 1954-1935). (SEQ. ID. NO. 5)

[0064] The indicated positions in the respective primers correspond to the published sequence of the genefnz (9).

[0065] In these colony hybridization tests, 41% of Fn-binding clones from the first panning and 30% from the second panning hybridized against the frz gene from subsp. zooepi-demicus ZV used as a probe.

[0066] D. Cloning and isolation of the gene sfs. Clones from Section C, displaying Fn-binding activity but negative in the colony hybridization assay, were selected as candidate sfs-gene-containing clones. Accordingly, these clones were sequenced using thermo sequence dye terminator cycle sequencing pre-mix kit (Amersham) and the ABI Model 377XL DNA sequencer. Computer programs from the PCGENE, DNA, and protein sequence analysis software package (Intelligenetics, Inc., Mountain View, Calif.) were used to record and analyze the sequence data.

[0067] Altogether, eleven Fn-binding but fnz negative clones were analyzed and found to contain inserts identical to one of four different types of inserts, all with overlapping sequences and an open reading frame. These four different phagemid clones designated S1-S4 are shown in FIG. 1B. Based on the overlapping sequences, primers were designed and used to generate a probe consisting of the complete sfs gene using PCR amplification as disclosed in the section concerned with Southern blots below.

[0068] To isolate the complete gene encoding the Fn-binding activity, Southern blot analysis of restriction enzyme digested chromosomal DNA of subsp. equi Bd 3221 was performed as disclosed below and revealed that a 2.6 kb SspI fragment contained sfs. Accordingly, fragments of this size were purified from a preparative agarose gel and ligated into pUC 19. The ligation mix was electroporated into E. coli and transformants were screened for Fn-binding activity as described above. Among several positive clones one, designated pSFS62, was selected and the insert sequenced.

[0069] This clone, pSFS62, had an open reading frame of 1,035 bp, from which the phagemid sequences were found to originate (FIG. 1 and SEQ. ID. NO. 2). The open reading frame is preceded by sequences typical for promoter and ribosome-binding sites (not shown) and is followed by sequences (not shown) resembling a transcriptional termination, suggesting that the gene is translated from a monocistronic messenger. The SFS-coding nucleotide sequence of the sfs gene is shown in SEQ. ID. NO. 2.

[0070] Southern blots. The Southern blot analysis referred to above and further below, was performed according to the following. Agarose imbedded chromosomal DNA digested with ApaI was resolved on 1.2% SeaKem GTG agarose gel (FMC, Rockland, Me.) in 0.5×TBE buffer by PFGE using a Gene Navigator (Pharmacia Biotech, Uppsala, Sweden) as earlier described (10). The DNA was transferred to nylon filters (Hybond-N+, Amersham) by vacuum blotting (VacuGene XL, Pharmacia Biotech) in accordance with the manufacturer's protocol. After cross-linking, the filters were prehybridized for 2 h at 65° C in 6×SSC, 3 × Denhardt's solution, and 0.5% SDS and subsequently incubated with the radioactively labeled sfs probe overnight, using the same conditions. The membranes were washed 3×20 min at 65° C with 0.2×SSC, 0.1% SDS and subjected to autoradiography.

[0071] The probe sfs was generated by PCR amplification of chromosomal DNA from subsp. equi Bd 3221 using the primers: (SEQ. ID. NO. 6) fs5, 5′-ACAAGCCATGGAGCACTTGTCTTTGGAGGT and (SEQ. ID. NO. 7) fr4, 5′-GTCGGGATTGTAAGAATAGCC.

[0072] The single band obtained after agarose gel electrophoresis was purified and random-primed.

EXAMPLE 2 Construction and Purification of SFS as a Fusion Protein.

[0073] The purified PCR-fragment from Example 1 encoding the mature protein of SFS, and described under Southern blots in Example 1, Section D, was digested with NcoI and ligated into SnaBI-NcoI opened pG8SAET. This vector encodes a 13 amino acid peptide tag (E-tag) which facilitates the purification of the recombinant protein using a HiTrap Anti-E tag column (Pharmacia Biotech). The recombinant protein SFS-E was purified from the periplasmic space according to the manufacturer's protocol.

[0074] After cleaving from the E-tag, the SFS protein was obtained having a calculated molecular mass of 40 kDa. The charged amino acids, followed by a stretch of hydrophobic residues in the N-terminal end of the protein, indicate a signal sequence and by the method of von Heijne (14) a possible signal sequence cleavage site was found between amino acids 29 and 30, resulting in a mature protein with a calculated molecular mass of 36 kDa. The isolated Fn-binding phagemid clones contained inserts originating from the central part of the protein, where two repetitive sequences of 21 residues, called R1 and R2, resp., are situated (FIG. 1). Three amino acids were found to dominate the composition of protein SFS, 53 residues are glycines (14.4%), 39 serines (10.6%), and 38 prolines (10.3%). These three amino acids are evenly distributed in the protein in contrast to the 13 tyrosine residues which occur only in the C-terminal part of the protein. Protein SFS does not contain any sequence motifs known to mediate attachment to the bacterial cell-wall.

EXAMPLE 3 Inhibition Assays

[0075] Cells from overnight cultures of streptococci were collected by centrifugation, washed in PBS, and suspended in PBS-0.2% Tween 20 to an optical density at 600 nm of 0.2. In cases of inhibition, 25 nM of affinity purified fusion protein SFS-E was preincubated 15 min with 16 pM of ¹²⁵I-labeled human Fn (91,061 cpm) and thereafter bacteria (500 μl) were added. After two hours incubation at room temperature, the mixtures were centrifuged and the supernatants removed. The radioactivity associated with the pellets was quantified in a gamma counter (LKB Wallac, Turku, Finland). Radioactivity (808 cpm) recovered from a control (tubes that contained no streptococci) was subtracted from each test.

EXAMPLE 4 Expression of the sfs Gene

[0076] RNA was extracted from S. equi cells by using the Blue FastRNA kit (Bio 101, Vista, Calif.) according to the manufacturer's protocol. RNA concentration was determined spectrophotometrically and by visual estimation of the rRNA bands on an agarose gel. RNA (10 μg) was loaded on a formaldehyde-containing agarose gel. RNA was transferred by vacuum-blotting to a positively charged nylon filter (Hybond-N+, Amersham) and cross-linked. Further steps were performed as described for Southern blots above with the exception that ssDNA was added to the pre-hybridization and hybridization solutions.

EXAMPLE 5 The Ability of SFS to Inhibit the Binding Between Collagen and Fn.

[0077] For the enzyme-linked immunosorbent assay (ELISA), polystyrene 96-well microtiter plates were coated for one hour with collagen type I from calf skin (Boehringer, Mannheim, Germany) in PBS. The wells were blocked for one hour with PBS-T supplemented with casein (0.1 mg/ml) and then washed four times with PBS-T. The fusion protein SFS-E was diluted in a two-fold serial and added to the wells together with 0.2 ng of Fn. After 2 h incubation, the wells were washed and a rabbit anti-Fn antibody was added and allowed to bind for 1 h. Finally, the wells were incubated for 1 h with a secondary HRP-labeled antibody. After washing, bound material was quantified by using tetramethylbenzidine (Boehringer) and a microplate reader (Bio-Tek Instruments, Vinooski, Vt.). Measurement was done at a wavelength of 450 nm. Absorbancy in wells without added fusion protein was set to 100% and absorbancy in wells where Fn had been excluded was set to 0%.

RESULTS

[0078] I. The gene sfs is generally present in isolates of subsp. equi. Southern blots performed as disclosed above revealed that a [2 P] dATP-labeled probe, corresponding to the gene sfs, hybridized to all the 50 subsp. equi and to 41 out of 48 subsp. zooepidemicus isolates tested. The results from the hybridization analysis are, for a selected number of strains, shown in FIG. 2. No significantly weak signal, that could not be explained by less chromosomal DNA on the gel, was detected for any of the positive S. equi isolates. The seven isolates of subsp. zooepidemicus that were sfs negative could not be related to each other, considering symptoms, temporal, and geographical origin. Furthermore, the seven negative isolates were obtained from different species, horses (n=4), cows (n=2), and dog (n=1). The sfs probe did not hybridize to any of the three control strains of other streptococcal species (FIG. 2).

[0079] II. Protein SFS displays sequence similarity to both collagen and a potential cell-wall protein of S. pyogenes. Collagen sequences gave highest scores when searching the database Swissprot for SFS-like sequences. The similarity was evenly distributed through protein SFS, and the main reason for the high score is the high content of glycine, serine, and proline, i.e. residues which are also common in collagen. However, a more pronounced similarity was seen for the Fn-binding domain of SFS against collagen. A sequence comparison was also done against the Oklahoma S. pyogenes genomic sequence database, which at the time of search consisted of 98% of the S. pyogenes genome. SFS aligned best against a database sequence which besides high content of glycine and proline residues also displayed the motif (SEQ. ID. NO. 8) QGERGETGP. Eight of these nine residues are present in the Fn-binding domain of SFS. Similar motifs are also present in chains of collagen. Alignment of these sequences are shown in the following Table I. At a closer study of the aligned S. pyogenes sequence, it was found that the aligned motif is situated in the middle of a potential gene, encoding a typical streptococcal cell-surface protein. This statement is based on the following: (i) promoter sequences and a putative ribosome-binding site are present adjacent to an open reading frame, (ii) in the C-terminal part there is a proline-rich domain with the cell-wall anchoring motif (SEQ. ID. NO. 9) LPXTGX, (iii) the LPXTGX motif is directly followed by a stretch of 23 hydrophobic residues and the open reading frame is terminated by six residues whereof three are charged, and (iv) a potential hairpin loop is situated 38 bp downstream the stop codon. However, a start codon in an acceptable distance to the ribosome binding site could not be found. TABLE I SFS (SEQ. ID. NO. 10) QGERGEAGPP S. pyogenes (SEQ. ID. NO. 11) QGERGETGPA Collagen α2 (I) 711 (SEQ. ID. NO. 12) PGERGEVGPA Collagen α1 (I) 991 (SEQ. ID. NO. 13) SGERGPPGPM Collagen α1 (II) 329 (SEQ. ID. NO. 14) PGERGRTGPA Collagen α1 (III) 797 (SEQ. ID. NO. 15) PGERGETGPP Collagen α1 (IV) 319 (SEQ. ID. NO. 16) QGEKGEAGPP

[0080] In this table, alignment of amino acid sequences from different types of collagen and the potential cell-surface protein from S. pyogenes to a motif present in the Fn-binding domain of SFS is illustrated. The figures indicate the number of the first amino acid from the collagen sequences. Bold letters indicate identical residue to the SFS motif.

[0081] III. Inhibition of binding between Fn and cells of S. equi. Recombinant protein SFS was purified by using affinity tails and the purified protein was found to bind Fn in a Western blot assay (data not shown). Before adding iodinated Fn to cells of S. equi the labeled Fn was, in appropriate cases, preincubated with SFS in a molar ratio of 1:1,500. After incubation the cells were collected by centrifugation and after removing the supernatant, the radioactivity bound to the pellets was measured. From the results from the inhibition experiments shown in FIG. 3 it is evident that the protein SFS has, for both subspecies, an inhibitory effect, although the two subsp. equi strains bind considerable less Fn compared to the two subsp. zooepidemicus strains

[0082] IV. Protein SFS inhibits the binding between Fn and collagen. The similarity between protein SFS and collagen suggested that these proteins might bind to the same site on the Fn molecule. In order to investigate this, microtiter wells coated with collagen were incubated with a mixture of Fn and a serial dilution of protein SFS. Bound Fn was detected by an anti-Fn antibody and as seen in FIG. 4, protein SFS inhibits the binding in a concentration dependent way. In a similar assay, the previously known protein FNZ did not inhibit the binding between Fn and collagen (data not shown). Furthermore, protein SFS did not inhibit the binding between the said protein FNZ and Fn, and the protein FNZ did not inhibit the binding between protein SFS and Fn. Protein SFS does not bind collagen. This was tested in order to control that the inhibition of binding between Fn and collagen by protein SFS is dependent on the binding of protein SFS to Fn and not to collagen. Taken together this suggests that protein SFS and the previously known protein FNZ have clearly separate binding sites on the Fn molecule and that protein SFS binds to the 30-40 kDa collagen-binding domain of Fn.

[0083] In the following Example 6, the potential use of the present protein as a vaccine is illustrated in a test wherein the immunogenic properties of the present novel protein are confirmed.

[0084] Example 6. Immunogenic properties of Protein SFS. Affinity purified recombinant protein SFS (Example 2) was, under reducing conditions, subjected to SDS-PAGE on a precasted 8-25% gradient-gel using the PHAST system (Pharmacia Biotech, Sweden). The molecular weight markers used were obtained from BioRad, CA, USA. After electrophoresis was completed, a nitrocellulose (NC) filter (Hybond C, Amersham, UK) previously soaked in PBS was put on the gel and the temperature raised to 45° C. After 45 minutes, the NC-filter was wetted with 1 ml PBS, and removed and placed in 15 ml PBS-T containing casein (0.1 mg/ml) for 1 hour (with two changes of PBS-T casein solution) at room temperature under gentle agitation.

[0085] The gradient-gel was after transfer removed and stained with Coomassie-blue using the PHAST system. The NC-filter was removed and incubated in 5 ml PBS-T casein solution containing 5 μl serum from a horse which previously had got the diagnosis strangles and found to be a carrier of S. equi. After 2 hours incubation at room temperature, under gentle agitation, the filter was extensively washed with PBS-T and incubated in 5 ml PBS-T casein solution containing rabbit anti horse antibodies (Nordic Immunology, Netherlands) at a dilution of 1:1000. After 1 hour of incubation at room temperature, under gentle agitation, the filter was extensively washed with PBS-T and incubated in 5 ml PBS-T casein solution containing horseradish peroxidase labeled goat anti rabbit IgG (BioRad) at a dilution of 1:1000. After 1 hour of incubation at room temperature, under gentle agitation, the filter was extensively washed with PBS-T and PBS.

[0086] To visualize the bound IgG conjugate, the filter was transferred to a solution containing a substrate for peroxidase (containing 25 ml PBS +6 ml 4-chloro-1-naphtol (Sigma, USA, 3 mg/ml in methanol)+20 μl H₂O₂ (35%). After about 15 minutes, the degree of color was measured by eye. The bands appearing on the NC-filter and the bands appearing on the corresponding Coomassie-blue stained PAGE were compared.

[0087] The obtained results clearly showed that (i) the recombinant produced SFS protein was recognized by antibodies present in the serum from the horse with strangles, and (ii) no bands were seen on the NC-filter in the lane with the different molecular weight markers. Thus, this means that protein SFS is expressed by S. equi during the infection process and that this protein is immunogenic

REFERENCES

[0088] 1. Cue, D., P. E. Dombek, H. Lam, and P. P. Cleary. 1998. Streptococcus pyogenes serotype M1 encodes multiple pathways for entry into human epithelial cells. Infect. Immun. 66:4593-4601.

[0089] 2. Galan, J. E., and J. F. Timoney. 1988. Immunologic and genetic comparison of streptococcus equi isolates from the united states and europe. J. Clin. Microbiol. 26:1142-1146.

[0090] 3. Hanski, E., and M. G. Caparon. 1992. Protein F, a fibronectin-binding protein, is an adhesin of the group A streptococcus Streptococcus pyogenes. Proc. Natl. Acad. Sci. USA 89:6172-6176.

[0091] 4. Hanski, E., P. A. Horwitz, and M. G. Caparon. 1992. Expression of protein F, the Fibronectin-binding protein of Streptococcus pyogenes JRS4, in heterologous streptococcal and enterococcal strains promotes their adherence to respiratory epithelial cells. Infect. Immun. 60:5119-5125.

[0092] 5. Hynes, R. O. 1990. Fibronectins. Springer Verlag, New York.

[0093] 6. Jacobsson, K., and L. Frykberg. 1996. Phage display shot-gun cloning of ligand-binding domains of procaryotic receptors approaches 100% correct clones. BioTechniques 20:1070-1081.

[0094] 7. Jadoun, J., V. Ozeri, E. Burstein, E. Skutelsky, E. Hanski, and S. Sela. 1998. Protein F 1 is required for efficient entry of Streptococcus pyogenes into epithelial cells. J. Infect. Dis. 178:147-158

[0095] 8. Jorm, L. R., D. N. Love, G. D. Bailey, G. M. Bailey, and D. A. Briscoe. 1994. Genetic structure of populations of P-haemolytic Lancefield group C streptococci from horses and their association with disease. Res. Vet. Sci. 57:292-299.

[0096] 9. Lindmark, H., K. Jacobsson, L. Frykberg, and B. Guss. 1996. Fibronectin-binding protein of Streptococcus equi subsp. zooepidemicus. Infect. Immun. 64:3993-3999.

[0097] 10. Lindmark, H., P. Jonsson, E. Olsson Engvall, and B. Guss. 1998. Pulsed-field gel electrophoresis and distribution of the genes zag and fnz in isolates of streptococcus equi. In Press.

[0098] 11. Molinari, G., S. R. Talay, P. Valentin-Weigand, M. Rohde, and G. S. Chhatwal. 1997. The fibronectin-binding protein of Streptococcus pyogenes SfbI, is involved in the internalization of group A streptococci by epithelial cells. Infect. Immun. 65:1357-1363.

[0099] 12. Nilsson, M., L. Frykberg, J. I. Flock, L. Pei, M. Lindberg, and B. Guss. 1998. A fibrinogen-binding protein of Staphylococcus epidermidis. Infect. Immun. 66:2666-2673.

[0100] 13. Skjold, S. A., P. G. Quie, L. A. Fries, M. Barnham, and P. P. Cleary. 1987. DNA fingerprinting of streptococcus zooepidemicus (Lancefield group C) as an aid to epidemiological study. J. Infect. Dis. 155:1145-1150.

[0101] 14. von Heijne, G. 1986. A new method for predicting signal sequence cleavage sites. Nucleic Acids Res 14:4683-4690.

[0102] Nucleotide Sequence Accession Number: The nucleotide sequence of the fnz gene (9) is available from the EMBL sequence data bank under accession number X99995. The complete gene sequence (SEQ. ID. NO. 17) and the deduced amino acid sequence (SEQ. ID. NO. 18) of the protein FNZ are shown below:                                               −35                        −10                   RB AAAGAAATCACCTTAAAACTATTACACATTCTCATCTCATATGATATAGTTATTCACCTTTCCTAACCCAAAACATATTAAAAACTTAATATACGGAG 120 4 S                      AGAAGACACTTCAAAACAAAAT L  K  T  K                                                                                ↓ CATTTAGAAAGGTACTGACGACGTCAGCTACCTGTATTGTGCTGCCAACAACCTTT

ACCAACCCTAC

TCTGGGCAGAGCAGCTTTATTAT 240 S  F  A  K  V  L  T  T  S  A  T  C  I  V  L  A  T  S  F  A  G  G  T  L  A  V  W  A  E  Q  L  Y  Y

GGGT

AATCATCCAACCACAC  C  W  N  D  G  T  R AAAGTTCGCCATATTTTTTGTACGTATCCCCTAAAAATCCTCCAACCCTGAATTAAAAGACGAGTATGTTGTTTATTGCTTTAACAAAAAATTCTATT 360 Q  S  S  F  Y  F  L  Y  V  S  P  K  N  A  P  K  R  E  K  D  E  Y  V  V  Y  C  F  N  K  K  L  Y 8

CCCCAGATCAATGGGAATCTA W  P  D  Q  W  E  S TATACACCAATTTTAATGACATCACATCTCCATATAACCATTACCTGTATATGAGAAAAAACTAGGATATGATATTTAAACAATATCCTCCA 480 I  Y  S  N  F  N  D  I  R  S  P  Y  N  D  L  F  V  Y  E  K  K  L  G  Y  D  G  I  F  K  Q  Y  A  P 124 CATTACAAAAAAGATATTAGTG  D  Y  K  K  D  I  S ATATTCCAACTCCTTTCCTCCCACTTTTAAGTAATGGATACCCCA

AACAAGTCACAACTATCAACTACCTACCATTTAAATAATGATTCTT

TAGA 600 D  I  A  S  A  L  V  A  V  L  S  N  G  Y  P  T  N  K  S  Q  L  S  T  S  Y  H  L  N  N  D  S  S  R  R 164 AAAGTTACTCAATTAGCCATTT  K  V  T  Q  L  A  I

TATTTTACTCATAGTTTAACAAAAGAATACCTTAAAGATACGGGTGGTTATAACTTAAACGATATCCAAAAAAAACCTTTACATTTTTTAAT

AGT 720 W  Y  F  S  D  S  L  T  K  E  Y  L  K  D  T  C  C  Y  N  L  N  D  M  E  K  K  A  L  D  F  L  I  

204 AAAGGAGAGGATTCTAAGCTTA  K  C  E  D  S  K  L AATCACACCACACTAATTACTCATTCCATATTTATCTTTATCAAAGTGGCGGGCATGACCATATGAAAGATTACCAAAATCTTCTCGCCTCTACCTTA 840 K  S  

  O  S  N  Y  S  L  D  I  Y  V  Y  C  

  G  C  H  D  H  M  K  D  Y  Q  N  L  L  G  S  T  L 244 → CP               ATTCCTAAAGAACCGCTAAAGC  I  P  K  E  P  L  K                                       → E1 CTCAGCTAGGTCCTTTTAGTCCACATAATCCAAATCCATTAA

TTGAAGGAGGATCATCAGGTTCACAAGAAACTAATCAACATCCTAACAA 960 P  Q  L  C  C  P  S  G  H  N  G  N  G  L  S  G  L  E  G  G  S  

  G  S  Q  E  T  N  E  D  G  K  K

GGACTTATAGGTTTCCATGGAG  G  L  I  C  P  H  C                                                                                             → 

2 GACTCTCAGGAAGCGAGGGCAAACGAGATCCTTTGCCAGGATTGAACCCTCACCCTGGTGCACCTGATACACCTCAAAAGCCTAATGATCCATTGCAA 1080 G  L  S  G  S  E  G  K  R  D  F  L  P  C  L  K  G  R  A  G  A  P  D  T  P  Q  K  P  N  D  P  L  Q 324 GCTCTTCAACCCGGTAACTCTC  G  L  E  G  G  N  E          → A2 CTATAGTACAACAAAACTATGGTAGTACCCAAGGATATCATGGTCAATCAGGCATTCTTGAGCAAACCCAACATACTAACCCACCTGGTATCATACTA 1200 P  I  V  E  Q  N  Y  G  S  T  E  G  Y  H  G  Q  S  C  I  L  E  E  T  E  D  T  N  F  F  G  I  I  L 364 → R1                  GGCGGCTCACCAAATGTCGAAA  C  G  S  G  N  V  E                                                                                                   → R2 CGCATGAACATACTACAACCCTCATCTGATCCCCATCGGCGGGGGTCTAGCTGGCGAATCAGGAGAACGACACCTAAACCACCACAAACCGGTGGG 1320 T  H  E  D  T  E  N  D  M  L  M  G  I  G  G  G  L  A  G  E  S  C  E  T  T  P  K  P  G  Q  T  G  G 404 CAAGGACCAGTCATCGAGACAA  Q  C  P  V  I  E  T                                                                                      → R3 CAGAGGATACACAAAAAGGCATGTCTGCACAATCTGGTGGCACTAT

AGTCAGAAAACACCAAAAAGCCGGAGGTCATGATTGGTGGTCACCCACAA 1440 T  E  D  T  Q  K  C  

  

  C  Q  

  C  G  T  I  

  

  E  N  T  K  K  P  E  V  M  I  G  G  Q  G  Q 444 ACCATCGAGACAACAGAGGACA  T  I  E  T  T  E  D                                                                            → R4 CACAAAAAGGCATGTCTGGACAATCTGGCGGTACTATCGAGTCAGAGCACACTAACAAACCTCAGGTCATCATTGGTGGTCAGGGACAAATCATTGAC 1560 T  Q  K  G  M  E  G  G  E  C  C  T  I  E  S  E  D  T  K  K  P  E  V  M  I  G  G  Q  G  Q  I  I  D 484 TTCTGAAAACACCATCAG  P  S  E  N  T  G  S                                                                   → R5 CTATGTCTCCCCACTCTGGTCACACTACGGTAATTGAGGATACCAAGAAGTCTGACATAATCATTCCTCCCCAACCACAAATCATGGACTTCTCTGAG 1660

  M  S  G  Q  S  G  D  T  T  V  I  E  D  T  K  K  S  E  I  I  I  G  G  Q  G  Q  I  I  D  P  E  E 524 GATACTCACCCGGGTATGTCTG  D  T  Q  P  G  M  S                                              → W GTCAATCTGGACGCACTACAATTGTGGAAGACACCAAGAAGCCGAGACCTAAGCCTAAACCTGCACCTGCGCCAATTCTTAATCACCAAAAACCTAAC 1810 C  Q  E  C  C  T  T  I  V  E  D  T  K  K  F  T  F  K  F  K  P  A  P  A  P  I  V  N  D  E  K  F  N 554 AAAGGCACTCATCTCCCACAGA  K  G  T  H  L  P  Q                  → 

CAAGTGATATGAAGCAACTCACCCTAAGCATCATCCCTGCAATCTCAATCTCAATGCTGCTTGTCCTATGTCTGTCTCTATTCAAGCGACCATCTAAAAAAGAG 1920 T  E  D  M  K  C  L  T  L  S  I  I  G  A  M  S  M  L  L  V  L  C  L  S  L  F  K  R  D  S  K  K  D 597 TAAGCAACACTGACAAGTGATC *                     ATTATTGTACGGACGTTTACAGGCGGGCTTTCTCCCCTCTTAAAACACCGTACAACACCAAAACGGTCTAAATAGAAATATCCCTCAAGGCTTAGACA ATCGCTCTAGCCCTTGGGGGCT TTTTTTCACATTTATAATAGAAGAGAGCAAGCAGTGACACTTTCTTTTTGCCCTATATCTGTTAACACTAGCGGTATGTGCAGTTAC

[0103]

1 18 1 281 PRT Streptococcus equi 1 Met Arg Lys Thr Glu Gly Arg Phe Arg Thr Trp Lys Ser Lys Lys Gln 1 5 10 15 Trp Leu Phe Ala Gly Ala Val Val Thr Ser Leu Leu Leu Gly Ala Ala 20 25 30 Leu Val Phe Gly Gly Leu Leu Gly Ser Leu Gly Gly Ser Ser His Gln 35 40 45 Ala Arg Pro Lys Glu Gln Pro Val Ser Ser Ile Gly Asp Asp Asp Lys 50 55 60 Ser His Lys Ser Ser Ser Asp Gln Pro Thr Asn His Gln His Gln Ala 65 70 75 80 Thr Ser Pro Ser Gln Pro Thr Ala Lys Ser Ser Gly His His Gly Asn 85 90 95 Gln Pro Gln Ser Leu Ser Val Asn Ser Gln Gly Asn Ser Ser Gly Gln 100 105 110 Ala Ser Glu Pro Gln Ala Ile Pro Asn Gln His His Gln Pro Gln Gly 115 120 125 Lys Pro Gln His Leu Asp Leu Gly Lys Asp Asn Ser Ser Pro Gln Pro 130 135 140 Gln Pro Lys Pro Gln Gly Asn Ser Pro Lys Leu Pro Glu Lys Gly Leu 145 150 155 160 Asn Gly Glu Asn Gln Lys Glu Pro Glu Gln Gly Glu Arg Gly Leu Pro 165 170 175 Gly Leu Asn Gly Glu Asn Gln Lys Glu Pro Glu Gln Gly Glu Arg Gly 180 185 190 Glu Ala Gly Pro Pro Ser Thr Pro Asn Leu Glu Gly Asn Asn Arg Lys 195 200 205 Asn Pro Leu Lys Gly Leu Asp Gly Glu Asn Lys Pro Lys Glu Asp Leu 210 215 220 Asp Gly Tyr Asn His Gly Arg Arg Asp Gly Tyr Arg Val Gly Tyr Glu 225 230 235 240 Asp Gly Tyr Gly Gly Lys Lys His Lys Gly Asp Tyr Pro Lys Arg Phe 245 250 255 Asp Glu Ser Ser Pro Lys Glu Tyr Asn Asp Tyr Ser Gln Gly Tyr Asn 260 265 270 Asp Asn Tyr Gly Asn Gly Asn Pro Asp 275 280 2 846 DNA Streptococcus equi 2 atgagaaaaa cagaaggacg ttttcgcaca tggaagtcca aaaaacaatg gctatttgcc 60 ggtgcagtgg gagctgcact tgtctttgga ggtttattag gaagtcttgg tggctcatcc 120 cagcagccag tcagctcgat tggagatgac gataagtcgc acaagagctc atcaccaccg 180 aaaaaggata acttgcagcc taagccttca gatcagccta ctaatcgccc gtcccagccg 240 acagcaaaga gctcaggtca tcatgggaat caacctcacc aaggaaatag tagtggacag 300 gcctcagagc ctcaggctat tcctaatcaa gggctgagag gaggtaacag ctctggttca 360 ggtcatcacc atcagccaca agatctaggt aaggataatt ctagcccgca gcctcaacca 420 aagcctcagg gcaaaaaagg cttgaatggt gaaaatcaga aggaaccgga gcaaggtgaa 480 cgaggttcag ggttgagtgg taataatcaa ggccgtcctt cgcttccagg cttgaatgca 540 gagcaaggtg aacgaggtga agccggtccc ccatcaactc cgaatttaga tcctttaaaa 600 ggattagatg gagagaataa gccaaaggaa gatttagacg gtaatgatga atcaccaaaa 660 cttaaagacg aacaccccta caatcatggt cgtcgctatg aggatggata tggtggcaaa 720 aagcacaaag gagattatcc taagcgaaag gaatataatg actatagtca agggtataat 780 gataattatg gaaatggcga tagaggtggt aagagaggat acggctattc ttacaatccc 840 gactaa 846 3 10 PRT Streptococcus equi 3 Gln Gly Glu Arg Gly Glu Ala Gly Pro Pro 1 5 10 4 32 DNA Artificial Sequence Derived from Streptococcus equi subspecies zooepidemicus 4 cgggatccct attacacatt ctcatctcat at 32 5 27 DNA Artificial Sequence Derived from Streptococcus equi subspecies zooepidemicus 5 ggaattccag aaagcccgcc tgtaaac 27 6 30 DNA Artificial Sequence Derived from Streptococcus equi subspecies equi Bd 3221 6 acaagccatg gagcacttgt ctttggaggt 30 7 21 DNA Artificial Sequence Derived from Streptococcus equi subspecies equi Bd 3221 7 gtcgggattg taagaatagc c 21 8 9 PRT Streptococcus pyogenes 8 Gln Gly Glu Arg Gly Glu Thr Gly Pro 1 5 9 6 PRT Streptococcus pyogenes UNSURE (3)..(3) Xaa equals any amino acid, unknown, or other 9 Leu Pro Xaa Thr Gly Xaa 1 5 10 10 PRT Streptococcus equi 10 Gln Gly Glu Arg Gly Glu Ala Gly Pro Pro 1 5 10 11 10 PRT Streptococcus pyogenes 11 Gln Gly Glu Arg Gly Glu Thr Gly Pro Ala 1 5 10 12 10 PRT unknown Collagen alpha2 (I) 711 12 Pro Gly Glu Arg Gly Glu Val Gly Pro Ala 1 5 10 13 10 PRT unknown Collagen alpha1 (I) 991 13 Ser Gly Glu Arg Gly Pro Pro Gly Pro Met 1 5 10 14 10 PRT unknown Collagen alpha1 (II) 329 14 Pro Gly Glu Arg Gly Arg Thr Gly Pro Ala 1 5 10 15 10 PRT unknown Collagen alpha1 (III) 797 15 Pro Gly Glu Arg Gly Glu Thr Gly Pro Pro 1 5 10 16 10 PRT unknown Collagen alpha1 (IV) 319 16 Gln Gly Glu Lys Gly Glu Ala Gly Pro Pro 1 5 10 17 2127 DNA Streptococcus equi CDS (108)..(1901) 17 aaagaaatca gcttaaaact attacacatt ctcatctcat atgatatagt tattcaggtt 60 tcctaacgga aaacatatta aaaacttaat atacggagag aagagag ttg aaa aca 116 Leu Lys Thr 1 aaa tca ttt aga aag gta ctg acg acg tca gct acc tgt att gtg ctg 164 Lys Ser Phe Arg Lys Val Leu Thr Thr Ser Ala Thr Cys Ile Val Leu 5 10 15 gca aca agc ttt gcc gga gga acc cta cgc gtc tgg gca gag cag ctt 212 Ala Thr Ser Phe Ala Gly Gly Thr Leu Arg Val Trp Ala Glu Gln Leu 20 25 30 35 tat tat ggg tgg aat gat gga acg aga caa agt tcg cca tat ttt ttg 260 Tyr Tyr Gly Trp Asn Asp Gly Thr Arg Gln Ser Ser Pro Tyr Phe Leu 40 45 50 tac gta tcg cct aaa aat gct cca aag cgt gaa tta aaa gac gag tat 308 Tyr Val Ser Pro Lys Asn Ala Pro Lys Arg Glu Leu Lys Asp Glu Tyr 55 60 65 gtt gtt tat tgc ttt aac aaa aaa ttg tat tgg cca gat caa tgg gaa 356 Val Val Tyr Cys Phe Asn Lys Lys Leu Tyr Trp Pro Asp Gln Trp Glu 70 75 80 tct ata tac agc aat ttt aat gac atc aga tct cca tat aac gat tta 404 Ser Ile Tyr Ser Asn Phe Asn Asp Ile Arg Ser Pro Tyr Asn Asp Leu 85 90 95 cct gta tat gag aaa aaa cta gga tat gat ggt ata ttt aaa caa tat 452 Pro Val Tyr Glu Lys Lys Leu Gly Tyr Asp Gly Ile Phe Lys Gln Tyr 100 105 110 115 gct cca gat tac aaa aaa gat att agt gat att gca agt gct ttg gtg 500 Ala Pro Asp Tyr Lys Lys Asp Ile Ser Asp Ile Ala Ser Ala Leu Val 120 125 130 gca gtt tta agt aat gga tac ccc act aac aag tca caa cta tca act 548 Ala Val Leu Ser Asn Gly Tyr Pro Thr Asn Lys Ser Gln Leu Ser Thr 135 140 145 agc tac cat tta aat aat gat tct tct aga aaa gtt act caa tta gcc 596 Ser Tyr His Leu Asn Asn Asp Ser Ser Arg Lys Val Thr Gln Leu Ala 150 155 160 att tgg tat ttt agt gat agt tta aca aaa gaa tac ctt aaa gat acg 644 Ile Trp Tyr Phe Ser Asp Ser Leu Thr Lys Glu Tyr Leu Lys Asp Thr 165 170 175 ggt ggt tat aac tta aac gat atg gaa aaa aaa gct tta gat ttt tta 692 Gly Gly Tyr Asn Leu Asn Asp Met Glu Lys Lys Ala Leu Asp Phe Leu 180 185 190 195 atc agt aaa gga gag gat tct aag ctt aaa tca gag cag agt aat tac 740 Ile Ser Lys Gly Glu Asp Ser Lys Leu Lys Ser Glu Gln Ser Asn Tyr 200 205 210 tca ttg gat att tat gtt tat caa agt ggc ggg cat gac cat atg aaa 788 Ser Leu Asp Ile Tyr Val Tyr Gln Ser Gly Gly His Asp His Met Lys 215 220 225 gat tac caa aat ctt ctc ggc tct acc tta att cct aaa gaa ccg cta 836 Asp Tyr Gln Asn Leu Leu Gly Ser Thr Leu Ile Pro Lys Glu Pro Leu 230 235 240 aag cct cag cta ggt ggt ttt agt gga cat aat gga aat gga tta agc 884 Lys Pro Gln Leu Gly Gly Phe Ser Gly His Asn Gly Asn Gly Leu Ser 245 250 255 ggc ctt gaa gga gga tca tca ggt tca caa gaa act aat gaa gat ggt 932 Gly Leu Glu Gly Gly Ser Ser Gly Ser Gln Glu Thr Asn Glu Asp Gly 260 265 270 275 aag aaa gga ctt ata ggt ttc cat gga gga ctc tca gga agc gag ggc 980 Lys Lys Gly Leu Ile Gly Phe His Gly Gly Leu Ser Gly Ser Glu Gly 280 285 290 aaa cga gat cct ttg cca gga ttg aag ggt gag gct ggt gca cct gat 1028 Lys Arg Asp Pro Leu Pro Gly Leu Lys Gly Glu Ala Gly Ala Pro Asp 295 300 305 aca cct caa aag cct aat gat cca ttg caa ggt ctt gaa ggc ggt aac 1076 Thr Pro Gln Lys Pro Asn Asp Pro Leu Gln Gly Leu Glu Gly Gly Asn 310 315 320 tct cct ata gta gaa caa aac tat ggt agt acc gaa gga tat cat ggt 1124 Ser Pro Ile Val Glu Gln Asn Tyr Gly Ser Thr Glu Gly Tyr His Gly 325 330 335 caa tca ggc att ctt gag gaa acc gaa gat act aac cca cct ggt atc 1172 Gln Ser Gly Ile Leu Glu Glu Thr Glu Asp Thr Asn Pro Pro Gly Ile 340 345 350 355 ata cta ggc ggc tca gga aat gtc gaa acg cat gaa gat act aga aac 1220 Ile Leu Gly Gly Ser Gly Asn Val Glu Thr His Glu Asp Thr Arg Asn 360 365 370 cct cat ctg atg ggg atc ggc ggc ggt cta gct ggc gaa tca gga gaa 1268 Pro His Leu Met Gly Ile Gly Gly Gly Leu Ala Gly Glu Ser Gly Glu 375 380 385 acg aca cct aaa cca gga caa acc ggc ggg caa gga cca gtc atc gag 1316 Thr Thr Pro Lys Pro Gly Gln Thr Gly Gly Gln Gly Pro Val Ile Glu 390 395 400 aca aca gag gat aca caa aaa ggc atg tct gga caa tct ggt ggc act 1364 Thr Thr Glu Asp Thr Gln Lys Gly Met Ser Gly Gln Ser Gly Gly Thr 405 410 415 atc gag tca gaa aac acc aaa aag ccg gag gtc atg att ggt ggt cag 1412 Ile Glu Ser Glu Asn Thr Lys Lys Pro Glu Val Met Ile Gly Gly Gln 420 425 430 435 gga caa acc atc gag aca aca gag gac aca caa aaa ggc atg tct gga 1460 Gly Gln Thr Ile Glu Thr Thr Glu Asp Thr Gln Lys Gly Met Ser Gly 440 445 450 caa tct ggc ggt act atc gag tca gag gac act aag aaa cct gag gtc 1508 Gln Ser Gly Gly Thr Ile Glu Ser Glu Asp Thr Lys Lys Pro Glu Val 455 460 465 atg att ggt ggt cag gga caa atc atc gac ttc tct gaa aac acc caa 1556 Met Ile Gly Gly Gln Gly Gln Ile Ile Asp Phe Ser Glu Asn Thr Gln 470 475 480 tca ggt atg tct ggg cag tct ggt gac act acg gta att gag gat acc 1604 Ser Gly Met Ser Gly Gln Ser Gly Asp Thr Thr Val Ile Glu Asp Thr 485 490 495 aag aag tct gag ata atc att ggt ggg caa gga caa atc atc gac ttc 1652 Lys Lys Ser Glu Ile Ile Ile Gly Gly Gln Gly Gln Ile Ile Asp Phe 500 505 510 515 tct gag gat act cag ccg ggt atg tct ggt caa tct gga ggc act aca 1700 Ser Glu Asp Thr Gln Pro Gly Met Ser Gly Gln Ser Gly Gly Thr Thr 520 525 530 att gtc gaa gac acc aag aag ccg aca cct aag cct aaa cct gca cct 1748 Ile Val Glu Asp Thr Lys Lys Pro Thr Pro Lys Pro Lys Pro Ala Pro 535 540 545 gcg cca att gtt aat gac gaa aaa cct aac aaa ggc act cat ctc cca 1796 Ala Pro Ile Val Asn Asp Glu Lys Pro Asn Lys Gly Thr His Leu Pro 550 555 560 cag aca agt gat atg aag caa ctc acc cta agc atc atc ggt gca atg 1844 Gln Thr Ser Asp Met Lys Gln Leu Thr Leu Ser Ile Ile Gly Ala Met 565 570 575 tca atg ctg ctt gtc cta tgt ctg tct cta ttc aag cga cca tct aaa 1892 Ser Met Leu Leu Val Leu Cys Leu Ser Leu Phe Lys Arg Pro Ser Lys 580 585 590 595 aaa gac taa gcaacactga caagtgatca ttattgtacg gacgtttaca 1941 Lys Asp ggcgggcttt ctgccctctt aaaacaccgt acaagagcaa aacggtctaa atagaaatat 2001 ccctcaaggg ttagacaatc gctctagccc ttgggggctt tttttgacat ttataataga 2061 agagagcaag cagtgacact ttctttttgc cctatatctg ttaacactag cggtatgtgc 2121 agttac 2127 18 597 PRT Streptococcus equi 18 Leu Lys Thr Lys Ser Phe Arg Lys Val Leu Thr Thr Ser Ala Thr Cys 1 5 10 15 Ile Val Leu Ala Thr Ser Phe Ala Gly Gly Thr Leu Arg Val Trp Ala 20 25 30 Glu Gln Leu Tyr Tyr Gly Trp Asn Asp Gly Thr Arg Gln Ser Ser Pro 35 40 45 Tyr Phe Leu Tyr Val Ser Pro Lys Asn Ala Pro Lys Arg Glu Leu Lys 50 55 60 Asp Glu Tyr Val Val Tyr Cys Phe Asn Lys Lys Leu Tyr Trp Pro Asp 65 70 75 80 Gln Trp Glu Ser Ile Tyr Ser Asn Phe Asn Asp Ile Arg Ser Pro Tyr 85 90 95 Asn Asp Leu Pro Val Tyr Glu Lys Lys Leu Gly Tyr Asp Gly Ile Phe 100 105 110 Lys Gln Tyr Ala Pro Asp Tyr Lys Lys Asp Ile Ser Asp Ile Ala Ser 115 120 125 Ala Leu Val Ala Val Leu Ser Asn Gly Tyr Pro Thr Asn Lys Ser Gln 130 135 140 Leu Ser Thr Ser Tyr His Leu Asn Asn Asp Ser Ser Arg Lys Val Thr 145 150 155 160 Gln Leu Ala Ile Trp Tyr Phe Ser Asp Ser Leu Thr Lys Glu Tyr Leu 165 170 175 Lys Asp Thr Gly Gly Tyr Asn Leu Asn Asp Met Glu Lys Lys Ala Leu 180 185 190 Asp Phe Leu Ile Ser Lys Gly Glu Asp Ser Lys Leu Lys Ser Glu Gln 195 200 205 Ser Asn Tyr Ser Leu Asp Ile Tyr Val Tyr Gln Ser Gly Gly His Asp 210 215 220 His Met Lys Asp Tyr Gln Asn Leu Leu Gly Ser Thr Leu Ile Pro Lys 225 230 235 240 Glu Pro Leu Lys Pro Gln Leu Gly Gly Phe Ser Gly His Asn Gly Asn 245 250 255 Gly Leu Ser Gly Leu Glu Gly Gly Ser Ser Gly Ser Gln Glu Thr Asn 260 265 270 Glu Asp Gly Lys Lys Gly Leu Ile Gly Phe His Gly Gly Leu Ser Gly 275 280 285 Ser Glu Gly Lys Arg Asp Pro Leu Pro Gly Leu Lys Gly Glu Ala Gly 290 295 300 Ala Pro Asp Thr Pro Gln Lys Pro Asn Asp Pro Leu Gln Gly Leu Glu 305 310 315 320 Gly Gly Asn Ser Pro Ile Val Glu Gln Asn Tyr Gly Ser Thr Glu Gly 325 330 335 Tyr His Gly Gln Ser Gly Ile Leu Glu Glu Thr Glu Asp Thr Asn Pro 340 345 350 Pro Gly Ile Ile Leu Gly Gly Ser Gly Asn Val Glu Thr His Glu Asp 355 360 365 Thr Arg Asn Pro His Leu Met Gly Ile Gly Gly Gly Leu Ala Gly Glu 370 375 380 Ser Gly Glu Thr Thr Pro Lys Pro Gly Gln Thr Gly Gly Gln Gly Pro 385 390 395 400 Val Ile Glu Thr Thr Glu Asp Thr Gln Lys Gly Met Ser Gly Gln Ser 405 410 415 Gly Gly Thr Ile Glu Ser Glu Asn Thr Lys Lys Pro Glu Val Met Ile 420 425 430 Gly Gly Gln Gly Gln Thr Ile Glu Thr Thr Glu Asp Thr Gln Lys Gly 435 440 445 Met Ser Gly Gln Ser Gly Gly Thr Ile Glu Ser Glu Asp Thr Lys Lys 450 455 460 Pro Glu Val Met Ile Gly Gly Gln Gly Gln Ile Ile Asp Phe Ser Glu 465 470 475 480 Asn Thr Gln Ser Gly Met Ser Gly Gln Ser Gly Asp Thr Thr Val Ile 485 490 495 Glu Asp Thr Lys Lys Ser Glu Ile Ile Ile Gly Gly Gln Gly Gln Ile 500 505 510 Ile Asp Phe Ser Glu Asp Thr Gln Pro Gly Met Ser Gly Gln Ser Gly 515 520 525 Gly Thr Thr Ile Val Glu Asp Thr Lys Lys Pro Thr Pro Lys Pro Lys 530 535 540 Pro Ala Pro Ala Pro Ile Val Asn Asp Glu Lys Pro Asn Lys Gly Thr 545 550 555 560 His Leu Pro Gln Thr Ser Asp Met Lys Gln Leu Thr Leu Ser Ile Ile 565 570 575 Gly Ala Met Ser Met Leu Leu Val Leu Cys Leu Ser Leu Phe Lys Arg 580 585 590 Pro Ser Lys Lys Asp 595 

1. An isolated protein, an analog thereof, or a fragment thereof, having an amino acid sequence encoded by a nucleic acid sequence, that can be isolated from and forms a portion of the genomes of Streptococcus equi, which protein can be expressed from said nucleic acid sequence and which protein binds specifically to mammal fibronectin, provided that the amino acid sequence shown in SEQ ID NO. 18 is excluded.
 2. An isolated protein specifically binding to fibronectin and having an amino acid sequence as shown in SEQ. ID. NO. 1, or an analog or fragment thereof.
 3. The protein of claim 2 having an amino acid sequence as shown in SEQ. ID. NO. 1 containing deletions or substitutions of amino acids.
 4. The protein of claim 2, wherein the protein is a fragment comprised of the amino acid sequence of SEQ. ID. NO. 1 that lacks an N-terminal sequence, suitably amino acids no 1-19 inclusive in the sequence of SEQ. ID. NO.
 1. 5. The protein of claim 3, which has an amino acid sequence corresponding to a portion of the sequence as shown in SEQ. ID. NO. 1 wherein an amino acid sequence binding to a collagen-binding domain of fibronectin (Fn) and comprising the sequence consisting of amino acids (SEQ. ID. NO. 3) QGERGEAGPP, is deleted.
 6. The protein of claim 1, wherein the protein has an amino acid composition of 53 glycine residues, 39 serine residues and 38 proline residues evenly distributed in the protein and optionally 13 tyrosine residues in the C-terminal part of the protein.
 7. A DNA fragment comprising a nucleotide sequence coding for a protein according to claim
 1. 8. The DNA fragment of claim 7, wherein said fragment has a nucleotide sequence as shown in SEQ, ID. NO. 2 or an equivalent thereof.
 9. A recombinant DNA molecule comprising a replicable vector, which suitably is an expression vector, and a DNA fragment according to claim 7 or 8 inserted therein.
 10. A host cell comprising a DNA fragment in accordance with claim
 7. 11. The host cell of claim 10, wherein said cell is a prokaryotic host cell, suitably a prokaryotic host cell comprised of a strain of E. coli.
 12. A method of producing the protein of claim 1, or fragments or analogs thereof comprising culturing a host cell as defined in claim 10, and isolating the expression product comprising the protein from the culture.
 13. The method of claim 12, wherein said method further comprises purification of the expression product, such as by affinity chromatography.
 14. The method of claim 12, which method comprises (a) introducing the DNA fragment encoding the protein or fragment or analog thereof into an expression vector; (b) introducing the said vector, which contains the said DNA fragment, into a compatible host cell; (c) culturing the host cell provided in step (b) under conditions required for expression of the product encoded by said DNA fragment; and (d) isolating the expressed product from the cultured host cell, and, optionally, (e) purifying the isolated product from step (d) by affinity chromatography or other chromatographic methods known in the art.
 15. A vaccine comprising the fibronectin-binding protein or a fragment or an analog thereof as defined in any one of claims 1-6 or as produced by a method as defined in any one of claims 12-14.
 16. The vaccine of claim 15, which vaccine is a vaccine that protects horses against strangles caused by S. equi infection.
 17. An antibody specific for a fibronectin-binding protein of any one of claims 1-6 or a fragment or an analog thereof, which antibody is polyclonal or monoclonal, or a fragment of said antibody.
 18. An antigenic preparation comprising an antigen consisting of the protein of any one of claims 1-6 or a fragment or an analog thereof.
 19. An antiserum comprising an antibody of claim 17, which is comprised of a polyclonal antibody.
 20. A method for the production of an antiserum, said method comprising administering an antigenic preparation of claim 18 to an animal host to produce antibodies in said animal host and recovering antiserum containing said antibodies produced in said host animal.
 21. A method of prophylactic or therapeutic treatment of S. equi infection in mammals, suitably horses, comprising administering an immunologically effective amount of a vaccine of claim 15 or 16, an antibody of claim 17, or an antiserum of claim
 19. 